Overview

Dataset statistics

Number of variables40
Number of observations13351
Missing cells2671
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 MiB
Average record size in memory117.0 B

Variable types

Numeric8
Categorical32

Alerts

Year has constant value "2019" Constant
df_index is highly correlated with YearHigh correlation
Price is highly correlated with Airline_5 and 1 other fieldsHigh correlation
Date is highly correlated with YearHigh correlation
Arrival_Hour is highly correlated with Arrival_Min and 2 other fieldsHigh correlation
Arrival_Min is highly correlated with Arrival_HourHigh correlation
Dep_Hour is highly correlated with Arrival_HourHigh correlation
Dep_Min is highly correlated with YearHigh correlation
Duration_in_minutes is highly correlated with Total_Stops and 5 other fieldsHigh correlation
Total_Stops is highly correlated with Duration_in_minutesHigh correlation
Month is highly correlated with Destination_5High correlation
Year is highly correlated with Source_2 and 30 other fieldsHigh correlation
Airline_1 is highly correlated with YearHigh correlation
Airline_2 is highly correlated with YearHigh correlation
Airline_3 is highly correlated with Duration_in_minutes and 1 other fieldsHigh correlation
Airline_4 is highly correlated with Arrival_Hour and 3 other fieldsHigh correlation
Airline_5 is highly correlated with Price and 1 other fieldsHigh correlation
Airline_6 is highly correlated with Duration_in_minutes and 2 other fieldsHigh correlation
Airline_7 is highly correlated with YearHigh correlation
Airline_8 is highly correlated with Additional_Info_7High correlation
Airline_9 is highly correlated with YearHigh correlation
Airline_10 is highly correlated with YearHigh correlation
Airline_11 is highly correlated with YearHigh correlation
Source_1 is highly correlated with Destination_4High correlation
Source_2 is highly correlated with Duration_in_minutes and 3 other fieldsHigh correlation
Source_3 is highly correlated with Source_2 and 1 other fieldsHigh correlation
Source_4 is highly correlated with Destination_3High correlation
Destination_1 is highly correlated with Duration_in_minutes and 3 other fieldsHigh correlation
Destination_2 is highly correlated with Duration_in_minutesHigh correlation
Destination_3 is highly correlated with Source_4High correlation
Destination_4 is highly correlated with Source_1High correlation
Destination_5 is highly correlated with MonthHigh correlation
Additional_Info_1 is highly correlated with YearHigh correlation
Additional_Info_2 is highly correlated with YearHigh correlation
Additional_Info_3 is highly correlated with Price and 1 other fieldsHigh correlation
Additional_Info_4 is highly correlated with YearHigh correlation
Additional_Info_5 is highly correlated with Airline_4 and 1 other fieldsHigh correlation
Additional_Info_6 is highly correlated with YearHigh correlation
Additional_Info_7 is highly correlated with Airline_8 and 1 other fieldsHigh correlation
Additional_Info_8 is highly correlated with Airline_4 and 2 other fieldsHigh correlation
Additional_Info_9 is highly correlated with YearHigh correlation
Price has 2670 (20.0%) missing values Missing
Arrival_Hour has 411 (3.1%) zeros Zeros
Arrival_Min has 1828 (13.7%) zeros Zeros
Dep_Min has 2590 (19.4%) zeros Zeros

Reproduction

Analysis started2022-11-01 12:21:55.327707
Analysis finished2022-11-01 12:22:22.285344
Duration26.96 seconds
Software versionpandas-profiling v3.4.0
Download configurationconfig.json

Variables

df_index
Real number (ℝ≥0)

HIGH CORRELATION

Distinct10681
Distinct (%)80.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4539.876713
Minimum0
Maximum10682
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size104.4 KiB

Quantile statistics

Minimum0
5-th percentile333.5
Q11668.5
median4006
Q37344.5
95-th percentile10014.5
Maximum10682
Range10682
Interquartile range (IQR)5676

Descriptive statistics

Standard deviation3208.941881
Coefficient of variation (CV)0.7068345869
Kurtosis-1.222304742
Mean4539.876713
Median Absolute Deviation (MAD)2671
Skewness0.3321333024
Sum60611894
Variance10297308
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
02
 
< 0.1%
17742
 
< 0.1%
17762
 
< 0.1%
17772
 
< 0.1%
17782
 
< 0.1%
17792
 
< 0.1%
17802
 
< 0.1%
17812
 
< 0.1%
17822
 
< 0.1%
17832
 
< 0.1%
Other values (10671)13331
99.9%
ValueCountFrequency (%)
02
< 0.1%
12
< 0.1%
22
< 0.1%
32
< 0.1%
42
< 0.1%
52
< 0.1%
62
< 0.1%
72
< 0.1%
82
< 0.1%
92
< 0.1%
ValueCountFrequency (%)
106821
< 0.1%
106811
< 0.1%
106801
< 0.1%
106791
< 0.1%
106781
< 0.1%
106771
< 0.1%
106761
< 0.1%
106751
< 0.1%
106741
< 0.1%
106731
< 0.1%

Total_Stops
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing1
Missing (%)< 0.1%
Memory size104.4 KiB
1.0
7056 
0.0
4340 
2.0
1896 
3.0
 
56
4.0
 
2

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters40050
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row2.0
3rd row2.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.07056
52.8%
0.04340
32.5%
2.01896
 
14.2%
3.056
 
0.4%
4.02
 
< 0.1%
(Missing)1
 
< 0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.07056
52.9%
0.04340
32.5%
2.01896
 
14.2%
3.056
 
0.4%
4.02
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
017690
44.2%
.13350
33.3%
17056
 
17.6%
21896
 
4.7%
356
 
0.1%
42
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number26700
66.7%
Other Punctuation13350
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
017690
66.3%
17056
 
26.4%
21896
 
7.1%
356
 
0.2%
42
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
.13350
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common40050
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
017690
44.2%
.13350
33.3%
17056
 
17.6%
21896
 
4.7%
356
 
0.1%
42
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII40050
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
017690
44.2%
.13350
33.3%
17056
 
17.6%
21896
 
4.7%
356
 
0.1%
42
 
< 0.1%

Price
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct1870
Distinct (%)17.5%
Missing2670
Missing (%)20.0%
Infinite0
Infinite (%)0.0%
Mean9085.898979
Minimum1759
Maximum79512
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size104.4 KiB

Quantile statistics

Minimum1759
5-th percentile3543
Q15277
median8372
Q312373
95-th percentile15764
Maximum79512
Range77753
Interquartile range (IQR)7096

Descriptive statistics

Standard deviation4610.92195
Coefficient of variation (CV)0.5074810935
Kurtosis13.31338345
Mean9085.898979
Median Absolute Deviation (MAD)3382
Skewness1.813560352
Sum97046487
Variance21260601.22
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10262258
 
1.9%
10844212
 
1.6%
7229162
 
1.2%
4804160
 
1.2%
4823131
 
1.0%
14714109
 
0.8%
3943104
 
0.8%
1512993
 
0.7%
384191
 
0.7%
1289886
 
0.6%
Other values (1860)9275
69.5%
(Missing)2670
 
20.0%
ValueCountFrequency (%)
17594
 
< 0.1%
18401
 
< 0.1%
196536
0.3%
201735
0.3%
205010
 
0.1%
20716
 
< 0.1%
21757
 
0.1%
222740
0.3%
22289
 
0.1%
23856
 
< 0.1%
ValueCountFrequency (%)
795121
 
< 0.1%
624271
 
< 0.1%
572091
 
< 0.1%
548263
< 0.1%
522851
 
< 0.1%
522291
 
< 0.1%
464901
 
< 0.1%
369831
 
< 0.1%
362352
< 0.1%
351851
 
< 0.1%

Date
Real number (ℝ≥0)

HIGH CORRELATION

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.39060745
Minimum1
Maximum27
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size104.4 KiB

Quantile statistics

Minimum1
5-th percentile1
Q16
median12
Q321
95-th percentile27
Maximum27
Range26
Interquartile range (IQR)15

Descriptive statistics

Standard deviation8.439748928
Coefficient of variation (CV)0.6302737917
Kurtosis-1.259719504
Mean13.39060745
Median Absolute Deviation (MAD)6
Skewness0.1349193501
Sum178778
Variance71.22936196
MonotonicityNot monotonic
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
91769
13.2%
61626
12.2%
211368
10.2%
271350
10.1%
11349
10.1%
241307
9.8%
151251
9.4%
121212
9.1%
31083
8.1%
181036
7.8%
ValueCountFrequency (%)
11349
10.1%
31083
8.1%
61626
12.2%
91769
13.2%
121212
9.1%
151251
9.4%
181036
7.8%
211368
10.2%
241307
9.8%
271350
10.1%
ValueCountFrequency (%)
271350
10.1%
241307
9.8%
211368
10.2%
181036
7.8%
151251
9.4%
121212
9.1%
91769
13.2%
61626
12.2%
31083
8.1%
11349
10.1%

Month
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
5
4329 
6
4285 
3
3410 
4
1327 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row5
3rd row6
4th row5
5th row3

Common Values

ValueCountFrequency (%)
54329
32.4%
64285
32.1%
33410
25.5%
41327
 
9.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
54329
32.4%
64285
32.1%
33410
25.5%
41327
 
9.9%

Most occurring characters

ValueCountFrequency (%)
54329
32.4%
64285
32.1%
33410
25.5%
41327
 
9.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
54329
32.4%
64285
32.1%
33410
25.5%
41327
 
9.9%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
54329
32.4%
64285
32.1%
33410
25.5%
41327
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
54329
32.4%
64285
32.1%
33410
25.5%
41327
 
9.9%

Year
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
2019
13351 

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters53404
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
201913351
100.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
201913351
100.0%

Most occurring characters

ValueCountFrequency (%)
213351
25.0%
013351
25.0%
113351
25.0%
913351
25.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number53404
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
213351
25.0%
013351
25.0%
113351
25.0%
913351
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common53404
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
213351
25.0%
013351
25.0%
113351
25.0%
913351
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII53404
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
213351
25.0%
013351
25.0%
113351
25.0%
913351
25.0%

Arrival_Hour
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.3957756
Minimum0
Maximum23
Zeros411
Zeros (%)3.1%
Negative0
Negative (%)0.0%
Memory size104.4 KiB

Quantile statistics

Minimum0
5-th percentile1
Q18
median14
Q319
95-th percentile22
Maximum23
Range23
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.896835956
Coefficient of variation (CV)0.5148515594
Kurtosis-1.077634955
Mean13.3957756
Median Absolute Deviation (MAD)5
Skewness-0.3844453106
Sum178847
Variance47.5663462
MonotonicityNot monotonic
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
192057
15.4%
121094
 
8.2%
41013
 
7.6%
21898
 
6.7%
22837
 
6.3%
1688
 
5.2%
18640
 
4.8%
23608
 
4.6%
8594
 
4.4%
10593
 
4.4%
Other values (14)4329
32.4%
ValueCountFrequency (%)
0411
3.1%
1688
5.2%
292
 
0.7%
361
 
0.5%
41013
7.6%
595
 
0.7%
664
 
0.5%
7518
3.9%
8594
4.4%
9591
4.4%
ValueCountFrequency (%)
23608
 
4.6%
22837
6.3%
21898
6.7%
20489
 
3.7%
192057
15.4%
18640
 
4.8%
17242
 
1.8%
16447
 
3.3%
15222
 
1.7%
14360
 
2.7%

Arrival_Min
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.65882705
Minimum0
Maximum55
Zeros1828
Zeros (%)13.7%
Negative0
Negative (%)0.0%
Memory size104.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q110
median25
Q335
95-th percentile50
Maximum55
Range55
Interquartile range (IQR)25

Descriptive statistics

Standard deviation16.5571781
Coefficient of variation (CV)0.6714503517
Kurtosis-1.038476359
Mean24.65882705
Median Absolute Deviation (MAD)10
Skewness0.1118113877
Sum329220
Variance274.1401466
MonotonicityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
01828
13.7%
151612
12.1%
251599
12.0%
351364
10.2%
201106
8.3%
301062
8.0%
50935
7.0%
45889
6.7%
5839
6.3%
40785
5.9%
Other values (2)1332
10.0%
ValueCountFrequency (%)
01828
13.7%
5839
6.3%
10717
 
5.4%
151612
12.1%
201106
8.3%
251599
12.0%
301062
8.0%
351364
10.2%
40785
5.9%
45889
6.7%
ValueCountFrequency (%)
55615
 
4.6%
50935
7.0%
45889
6.7%
40785
5.9%
351364
10.2%
301062
8.0%
251599
12.0%
201106
8.3%
151612
12.1%
10717
5.4%

Dep_Hour
Real number (ℝ≥0)

HIGH CORRELATION

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.51299528
Minimum0
Maximum23
Zeros51
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size104.4 KiB

Quantile statistics

Minimum0
5-th percentile5
Q18
median11
Q318
95-th percentile22
Maximum23
Range23
Interquartile range (IQR)10

Descriptive statistics

Standard deviation5.736678072
Coefficient of variation (CV)0.4584576229
Kurtosis-1.197845565
Mean12.51299528
Median Absolute Deviation (MAD)5
Skewness0.1092079702
Sum167061
Variance32.9094753
MonotonicityNot monotonic
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
91151
 
8.6%
71067
 
8.0%
8872
 
6.5%
6863
 
6.5%
17847
 
6.3%
20826
 
6.2%
5776
 
5.8%
11714
 
5.3%
19710
 
5.3%
10677
 
5.1%
Other values (14)4848
36.3%
ValueCountFrequency (%)
051
 
0.4%
144
 
0.3%
2228
 
1.7%
330
 
0.2%
4219
 
1.6%
5776
5.8%
6863
6.5%
71067
8.0%
8872
6.5%
91151
8.6%
ValueCountFrequency (%)
23189
 
1.4%
22486
3.6%
21625
4.7%
20826
6.2%
19710
5.3%
18553
4.1%
17847
6.3%
16602
4.5%
15431
3.2%
14647
4.8%

Dep_Min
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.50265898
Minimum0
Maximum55
Zeros2590
Zeros (%)19.4%
Negative0
Negative (%)0.0%
Memory size104.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q15
median25
Q340
95-th percentile55
Maximum55
Range55
Interquartile range (IQR)35

Descriptive statistics

Standard deviation18.83169624
Coefficient of variation (CV)0.7685572515
Kurtosis-1.304474933
Mean24.50265898
Median Absolute Deviation (MAD)20
Skewness0.1597992843
Sum327135
Variance354.6327832
MonotonicityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
02590
19.4%
301491
11.2%
551332
10.0%
451106
8.3%
101099
8.2%
5951
 
7.1%
15876
 
6.6%
25864
 
6.5%
20819
 
6.1%
35812
 
6.1%
Other values (2)1411
10.6%
ValueCountFrequency (%)
02590
19.4%
5951
 
7.1%
101099
8.2%
15876
 
6.6%
20819
 
6.1%
25864
 
6.5%
301491
11.2%
35812
 
6.1%
40646
 
4.8%
451106
8.3%
ValueCountFrequency (%)
551332
10.0%
50765
5.7%
451106
8.3%
40646
4.8%
35812
6.1%
301491
11.2%
25864
6.5%
20819
6.1%
15876
6.6%
101099
8.2%

Duration_in_minutes
Real number (ℝ≥0)

HIGH CORRELATION

Distinct373
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean642.4451352
Minimum75
Maximum2860
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size104.4 KiB

Quantile statistics

Minimum75
5-th percentile90
Q1175
median520
Q3930
95-th percentile1615
Maximum2860
Range2785
Interquartile range (IQR)755

Descriptive statistics

Standard deviation506.6412684
Coefficient of variation (CV)0.788614063
Kurtosis-0.1417844086
Mean642.4451352
Median Absolute Deviation (MAD)350
Skewness0.8680851863
Sum8577285
Variance256685.3748
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
170672
 
5.0%
90493
 
3.7%
165432
 
3.2%
175418
 
3.1%
155399
 
3.0%
180333
 
2.5%
140286
 
2.1%
150278
 
2.1%
160196
 
1.5%
135164
 
1.2%
Other values (363)9680
72.5%
ValueCountFrequency (%)
7530
 
0.2%
8081
 
0.6%
85159
 
1.2%
90493
3.7%
9522
 
0.2%
135164
 
1.2%
140286
2.1%
145122
 
0.9%
150278
2.1%
155399
3.0%
ValueCountFrequency (%)
28601
 
< 0.1%
28201
 
< 0.1%
25651
 
< 0.1%
25251
 
< 0.1%
24801
 
< 0.1%
24401
 
< 0.1%
24201
 
< 0.1%
23453
< 0.1%
23155
< 0.1%
23006
< 0.1%

Airline_1
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
11161 
1
2190 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
011161
83.6%
12190
 
16.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
011161
83.6%
12190
 
16.4%

Most occurring characters

ValueCountFrequency (%)
011161
83.6%
12190
 
16.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
011161
83.6%
12190
 
16.4%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
011161
83.6%
12190
 
16.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
011161
83.6%
12190
 
16.4%

Airline_2
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
13111 
1
 
240

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013111
98.2%
1240
 
1.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013111
98.2%
1240
 
1.8%

Most occurring characters

ValueCountFrequency (%)
013111
98.2%
1240
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013111
98.2%
1240
 
1.8%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013111
98.2%
1240
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013111
98.2%
1240
 
1.8%

Airline_3
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
10787 
1
2564 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row0
4th row1
5th row1

Common Values

ValueCountFrequency (%)
010787
80.8%
12564
 
19.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
010787
80.8%
12564
 
19.2%

Most occurring characters

ValueCountFrequency (%)
010787
80.8%
12564
 
19.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
010787
80.8%
12564
 
19.2%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
010787
80.8%
12564
 
19.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
010787
80.8%
12564
 
19.2%

Airline_4
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
8606 
1
4745 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
08606
64.5%
14745
35.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
08606
64.5%
14745
35.5%

Most occurring characters

ValueCountFrequency (%)
08606
64.5%
14745
35.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
08606
64.5%
14745
35.5%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
08606
64.5%
14745
35.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
08606
64.5%
14745
35.5%

Airline_5
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
13343 
1
 
8

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013343
99.9%
18
 
0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013343
99.9%
18
 
0.1%

Most occurring characters

ValueCountFrequency (%)
013343
99.9%
18
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013343
99.9%
18
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013343
99.9%
18
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013343
99.9%
18
 
0.1%

Airline_6
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
11808 
1
1543 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
011808
88.4%
11543
 
11.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
011808
88.4%
11543
 
11.6%

Most occurring characters

ValueCountFrequency (%)
011808
88.4%
11543
 
11.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
011808
88.4%
11543
 
11.6%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
011808
88.4%
11543
 
11.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
011808
88.4%
11543
 
11.6%

Airline_7
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
13335 
1
 
16

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013335
99.9%
116
 
0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013335
99.9%
116
 
0.1%

Most occurring characters

ValueCountFrequency (%)
013335
99.9%
116
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013335
99.9%
116
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013335
99.9%
116
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013335
99.9%
116
 
0.1%

Airline_8
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
12325 
1
 
1026

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
012325
92.3%
11026
 
7.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
012325
92.3%
11026
 
7.7%

Most occurring characters

ValueCountFrequency (%)
012325
92.3%
11026
 
7.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
012325
92.3%
11026
 
7.7%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
012325
92.3%
11026
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
012325
92.3%
11026
 
7.7%

Airline_9
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
13350 
1
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Airline_10
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
12743 
1
 
608

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
012743
95.4%
1608
 
4.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
012743
95.4%
1608
 
4.6%

Most occurring characters

ValueCountFrequency (%)
012743
95.4%
1608
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
012743
95.4%
1608
 
4.6%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
012743
95.4%
1608
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
012743
95.4%
1608
 
4.6%

Airline_11
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
13346 
1
 
5

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013346
> 99.9%
15
 
< 0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013346
> 99.9%
15
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
013346
> 99.9%
15
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013346
> 99.9%
15
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013346
> 99.9%
15
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013346
> 99.9%
15
 
< 0.1%

Source_1
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
12895 
1
 
456

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
012895
96.6%
1456
 
3.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
012895
96.6%
1456
 
3.4%

Most occurring characters

ValueCountFrequency (%)
012895
96.6%
1456
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
012895
96.6%
1456
 
3.4%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
012895
96.6%
1456
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
012895
96.6%
1456
 
3.4%

Source_2
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
7670 
1
5681 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
07670
57.4%
15681
42.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
07670
57.4%
15681
42.6%

Most occurring characters

ValueCountFrequency (%)
07670
57.4%
15681
42.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
07670
57.4%
15681
42.6%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
07670
57.4%
15681
42.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
07670
57.4%
15681
42.6%

Source_3
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
9770 
1
3581 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
09770
73.2%
13581
 
26.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
09770
73.2%
13581
 
26.8%

Most occurring characters

ValueCountFrequency (%)
09770
73.2%
13581
 
26.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
09770
73.2%
13581
 
26.8%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
09770
73.2%
13581
 
26.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
09770
73.2%
13581
 
26.8%

Source_4
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
12470 
1
 
881

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
012470
93.4%
1881
 
6.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
012470
93.4%
1881
 
6.6%

Most occurring characters

ValueCountFrequency (%)
012470
93.4%
1881
 
6.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
012470
93.4%
1881
 
6.6%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
012470
93.4%
1881
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
012470
93.4%
1881
 
6.6%

Destination_1
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
7670 
1
5681 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
07670
57.4%
15681
42.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
07670
57.4%
15681
42.6%

Most occurring characters

ValueCountFrequency (%)
07670
57.4%
15681
42.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
07670
57.4%
15681
42.6%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
07670
57.4%
15681
42.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
07670
57.4%
15681
42.6%

Destination_2
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
11769 
1
1582 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
011769
88.2%
11582
 
11.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
011769
88.2%
11582
 
11.8%

Most occurring characters

ValueCountFrequency (%)
011769
88.2%
11582
 
11.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
011769
88.2%
11582
 
11.8%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
011769
88.2%
11582
 
11.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
011769
88.2%
11582
 
11.8%

Destination_3
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
12470 
1
 
881

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
012470
93.4%
1881
 
6.6%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
012470
93.4%
1881
 
6.6%

Most occurring characters

ValueCountFrequency (%)
012470
93.4%
1881
 
6.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
012470
93.4%
1881
 
6.6%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
012470
93.4%
1881
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
012470
93.4%
1881
 
6.6%

Destination_4
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
12895 
1
 
456

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
012895
96.6%
1456
 
3.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
012895
96.6%
1456
 
3.4%

Most occurring characters

ValueCountFrequency (%)
012895
96.6%
1456
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
012895
96.6%
1456
 
3.4%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
012895
96.6%
1456
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
012895
96.6%
1456
 
3.4%

Destination_5
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
12181 
1
 
1170

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
012181
91.2%
11170
 
8.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
012181
91.2%
11170
 
8.8%

Most occurring characters

ValueCountFrequency (%)
012181
91.2%
11170
 
8.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
012181
91.2%
11170
 
8.8%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
012181
91.2%
11170
 
8.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
012181
91.2%
11170
 
8.8%

Additional_Info_1
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
13350 
1
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Additional_Info_2
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
13350 
1
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Additional_Info_3
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
13346 
1
 
5

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013346
> 99.9%
15
 
< 0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013346
> 99.9%
15
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
013346
> 99.9%
15
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013346
> 99.9%
15
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013346
> 99.9%
15
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013346
> 99.9%
15
 
< 0.1%

Additional_Info_4
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
13343 
1
 
8

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013343
99.9%
18
 
0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013343
99.9%
18
 
0.1%

Most occurring characters

ValueCountFrequency (%)
013343
99.9%
18
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013343
99.9%
18
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013343
99.9%
18
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013343
99.9%
18
 
0.1%

Additional_Info_5
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
10925 
1
2426 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
010925
81.8%
12426
 
18.2%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
010925
81.8%
12426
 
18.2%

Most occurring characters

ValueCountFrequency (%)
010925
81.8%
12426
 
18.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
010925
81.8%
12426
 
18.2%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
010925
81.8%
12426
 
18.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
010925
81.8%
12426
 
18.2%

Additional_Info_6
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
13348 
1
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013348
> 99.9%
13
 
< 0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013348
> 99.9%
13
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
013348
> 99.9%
13
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013348
> 99.9%
13
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013348
> 99.9%
13
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013348
> 99.9%
13
 
< 0.1%

Additional_Info_7
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
12955 
1
 
396

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
012955
97.0%
1396
 
3.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
012955
97.0%
1396
 
3.0%

Most occurring characters

ValueCountFrequency (%)
012955
97.0%
1396
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
012955
97.0%
1396
 
3.0%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
012955
97.0%
1396
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
012955
97.0%
1396
 
3.0%

Additional_Info_8
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
1
10490 
0
2861 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
110490
78.6%
02861
 
21.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
110490
78.6%
02861
 
21.4%

Most occurring characters

ValueCountFrequency (%)
110490
78.6%
02861
 
21.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
110490
78.6%
02861
 
21.4%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
110490
78.6%
02861
 
21.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
110490
78.6%
02861
 
21.4%

Additional_Info_9
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size104.4 KiB
0
13350 
1
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters13351
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number13351
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common13351
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII13351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013350
> 99.9%
11
 
< 0.1%

Interactions

Correlations

Auto

The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

df_indexTotal_StopsPriceDateMonthYearArrival_HourArrival_MinDep_HourDep_MinDuration_in_minutesAirline_1Airline_2Airline_3Airline_4Airline_5Airline_6Airline_7Airline_8Airline_9Airline_10Airline_11Source_1Source_2Source_3Source_4Destination_1Destination_2Destination_3Destination_4Destination_5Additional_Info_1Additional_Info_2Additional_Info_3Additional_Info_4Additional_Info_5Additional_Info_6Additional_Info_7Additional_Info_8Additional_Info_9
000.03897.02432019110222017000100000000000000001000000010
112.07662.0152019131555044510000000000001000000000000010
222.013882.0962019425925114000010000000010010000000000010
331.06218.01252019233018532500100000000001000000000000010
441.013302.01320192135165028500100000000000000001000000010
550.03873.0246201911259014500000001000001000000000000010
661.011087.012320191025185593000010000000000000001000010000
771.022270.01320195580126500010000000000000001000000010
881.011087.012320191025855153000010000000000000001000010000
991.08625.027520191915112547000000100000010010000000000010

Last rows

df_indexTotal_StopsPriceDateMonthYearArrival_HourArrival_MinDep_HourDep_MinDuration_in_minutesAirline_1Airline_2Airline_3Airline_4Airline_5Airline_6Airline_7Airline_8Airline_9Airline_10Airline_11Source_1Source_2Source_3Source_4Destination_1Destination_2Destination_3Destination_4Destination_5Additional_Info_1Additional_Info_2Additional_Info_3Additional_Info_4Additional_Info_5Additional_Info_6Additional_Info_7Additional_Info_8Additional_Info_9
1334126612.0NaN27320194251910199500010000000010010000000000010
1334226620.0NaN2152019152513559010000000000000100100000000010
1334326631.0NaN1252019745233049501000000000001000000000000010
1334426641.0NaN1562019130151561500000100000010010000000000010
1334526650.0NaN216201901522459000000001000000100100000000100
1334626661.0NaN66201920252030143510000000000001000000000000010
1334726670.0NaN27320191655142015500100000000001000000000000010
1334826681.0NaN632019425215039500010000000010010000000000010
1334926691.0NaN63201919154091510000000000010010000000000010
1335026701.0NaN1562019191545586000000100000010010000000000010